OVM: Out-of-Order Execution Parallel Virtual Machine

نویسندگان

  • George Bosilca
  • Gilles Fedak
  • Franck Cappello
چکیده

High performance computing on parallel architectures currently uses different approaches depending on the hardware memory model of the architecture, the abstraction level of the programming environment and the nature of the application. In this paper; we introduce an original client-server execution model based on RPCs called Outof-order parallel Virtual Machine (OVM). OVM aims to provide three main features: portabilitythrough a unique memory model, load-balancing using a plug-in support and high performance provided by several optimizations. The main optimizations are: non-blocking RPCs, data flow management, persistent and non persistent data, static data set distribution, dynamic scheduling and asynchronous global operations. We present OVM general architecture and demonstrate high performance for regular parallel applications, a parallel application with load balancing needs and a real time parallel application.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Pii: S0167-739x(01)00071-1

6 High performance computing on parallel architectures currently uses different approaches depending on the hardware memory model of the architecture, the abstraction level of the programming environment and the nature of the application. In this article, we introduce an original client–server execution model based on RPCs called out-of-order parallel virtual machine (OVM). OVM aims to provide ...

متن کامل

Execution-Driven Distributed Simulation of Parallel Architectures

A new methodology for the asynchronous discrete event-driven simulation of parallel computers is proposed. This methodology integrates sequential and distributed simulation in a unified paradigm and is applicable to the simulation of all classes of parallel computer architectures. In our own simulation work we accelerated simulations by more than an order of magnitude with parallel execution sp...

متن کامل

A predecoding technique for ILP exploitation in Java processors

Java processors have been introduced to offer hardware acceleration for Java applications. They execute Java bytecodes directly in hardware. However, the stack nature of the Java virtual machine instruction set imposes a limitation on the achievable execution performance. In order to exploit instruction level parallelism and allow out of order execution, we must remove the stack completely. Thi...

متن کامل

Data-Driven Multithreading Programming Tool-chain

The increasing parallelism offered by the parallel architectures introduced by processor vendors, coupled with the need to extract more parallelism out of the applications, has led the community to examine more efficient programming and execution models. The Dataflow Multithreading model is known to be the model that can exploit the most parallelism out of a wide range of applications. The Data...

متن کامل

H ABILITATIONSSCHRIFT Time - predictable Computer Architecture

Architectural advancements in modern processor designs increase average performance with features such as pipelines, caches, branch prediction, and out-of-order execution. However, these features complicate worst-case execution time analysis and lead to very conservative estimates. JOP (Java Optimized Processor) tackles this problem from the architectural perspective – by introducing a processo...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2001